LinkedIn Courses Notes

https://www.linkedin.com/learning/learning-data-analytics-1-foundations/l

Bad SQL introduction :/
She sucks at explaining SQL introduction. She didn't even mention tables, schema, is it on row, or column wtf.

She was good at explaining how a data analyst thinks when looking at spreadsheets

Oh wow, so this course isn't all about getting you started as being a data analyst
BUT MAKING YOU SEE what a data analyst do, how they solve problems,
It's basically a comprehensive preview of what a data analyst do in an organization
Tho, In a bad way. Course is not good toward beginners even though the course design was targeted towards beginners.

I'm abandoning this, it's not helpful.
There's two types of shits, bad shit and good shit. It's a good shit in some perspective and use cases
But for my circumstances and I don't recommend it towards

  • Beginners at Tech
  • Beginners in Data Analytics
  • Those who have programming experiences

Introduction to data and charts (linkedin.com)
Let's learn some statistics first

Finally G! FINALLY! I understood why we use mean, median, mode
Wtf G, these are just starting points when we don't know the data inside of our datasets
Like, a way for us to gauge and guess what's inside of thousands of data in a dataset
If you can understood the middle point of a data, then you could estimate and gauge the inside of it
Problem: You want to gauge what's inside of a dataset
Solution: Find the center of the data set
-> See it this way, you have a data set, 3, 1, 20, 4, 5
-> IMAGINE YOU DON'T KNOW what's inside of this
-> So to get a better gauge as to what numbers are in there, you FIND THE CENTER

It's a guess to another guess to another guess to another guess, to another question to a guess, Oh I might like playing with data. It's a playground for inquisitive endless type of curiosity and the satisfaction of discovery. But we'll see :)

Finding the Center of a Dataset

Mean, Median, Mode

n is how many values there are
Mean: (sum_all_values) / n Basically the average
Median: (n + 1) / 2 The middle position of a dataset in ascending/descending order
Mode: none Basically the most repetitive number in the dataset

MEDIAN FACTS
The median never changes its value regardless if its ascending/descending order.
Unlike the mean or mode, which can change with the rearrangement of data, the median remains constant as long as the middle value(s) remain the same.

Well what happens if it results into 10.5? So there's no true middle value since the dataset is even
So it's saying the middle value is in the middle of 10th position and 11th position

The median divides a dataset into two equal parts.
Think about median as the way for you to half a dataset in respect with the neighbhouring middle values

What is the median for this data set with six values?
30,40,40,60,80,120
Median: 50, But the median doesn't exist in the dataset

Bruv, look at this. So if we have a dataset of 1, 2, 3, 4, 20
Mean = 15
Median = 3
So, we can form a guess that the dataset has a big gap between the large number and small number
The bigger the difference between mean and median, the bigger the gap is between large and small number. Something fishy must've happen in here as to why there is such a huge gap.


Variability

Wtf is this, "range" makes sense
Now we have a bell shaped (let's be honest, it doesn't look like a bell) thing, a z score, a standard deviation. He didn't explained it well for someone who is new to these things lol